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Exploiting the algebraic structure of the set of bimatrix games, a divide-and-conquer algo- 
rithm for finding Nash equilibria is proposed. The algorithm is fixed-parameter tractable 
with the size of the largest irreducible component of a game as parameter. An implemen- 
tation of the algorithm is shown to yield a significant performance increase on inputs with 
small parameters. 

1 Introduction 

A bimatrix game is given by two matrices (A, B) of identical dimensions. The first player picks 
a row i, the second player independently picks a column j. As a consequence, the first player 
receives the payoff Aij, the second player Bij. Both player are allowed to randomize over their 
choices, and will strive to maximize their expected payoff. A Nash equilibrium is a pair of 
strategies, such that no player can improve her expected payoff by deviating unilaterally. 

If the payoff matrices are given by natural numbers, then there always is a Nash equilibrium 
using only rational probabilities. The computational task to find a Nash equilibrium of a bima- 
trix game is complete for the complexity class PPAD |15|, [5| . PPAD is contained in FNP, and 
commonly believed to exceed FP. In particular, it is deemed unlikely that a polynomial-time 
algorithm for finding Nash equilibria exists. 

The next-best algorithmic result to hope for could be a fixed-parameter tractable (fpt) al- 
gorithm [6l |9], that is an algorithm running in time f{k)p(n) where n is the size of the game, 
p a polynomial and k a parameter. For such an algorithm to be useful, the assumption the pa- 
rameter were usually small needs to be sustainable. The existence of fpt algorithms for finding 
Nash equilibria with various choices of parameters has been studied in [3 ITTl |8| . 

In the present paper we demonstrate how products and sums of games - and their inverse 
operations - can be used to obtain a divide-and-conquer algorithm to find Nash equilibria. 
This algorithm is fpt, if the size of the largest component not further dividable is chosen as a 
parameter. Products of games were introduced in [17] as a means to classify the Weihrauch- 
degree [H El H [T2] of finding Nash equilibria for real- valued payoff matrices. Sums appear 
originally in the PhD thesis |18j of the second author; the algorithm we discuss was implemented 
in the Bachelor's thesis |13j of the first author. 

2 Products and Sums of Games 

Both products and sums admit an intuitive explanation: The product of two games corresponds 
to playing both games at the same time, while the sum involves playing matching pennies to 
determine which game to play with one player being rewarded and the other one punished in 
the case of a failure to agree. 
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2.1 Products 



In our definition of products, we let [ , ] : {1, . . . , n} x {1, . . . , m} — > {1, . . . , nm} denote the 
usual bijection = (i — l)n + j. The relevant values of n,m will be clear from the context. 
We point out that [ , ] is polynomial-time computable and polynomial-time invertible. 
Definition 1. Given an n\ x m\ bimatrix game (^l 1 ,^ 1 ) and an ri2 x mi bimatrix game 
(A 2 ,B 2 ), we define the (nin 2 ) x (mim 2 ) product game (A 1 ,!! 1 ) x (A 2 ,B 2 ) as (A, B) with 

A lh,i2]lh,h] = A hn + A lj 2 and B [hM\juh] = B hh + B lj 2 - 

Theorem 2. If (x k ,y k ) is a Nash equilibrium of (A k ,B k ) for both k G {0, 1}, then (x,y) is a 
Nash equilibrium of (A, B), where x [ilh] = x\x\ and y [mim2 ] = y mi y m . 2 - 

Proof. We will prove that x is a best response to y, if x fe is a best response to y fe for both k G 
{0, 1}, the remaining part is analogous. By applying the following equivalence transformation 

(mn2) (77117712) 
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on both sides of the best response condition 

771772 77717772 7*1 772 77717772 

^ ^ x i A i,jVj - 5Z 5Z X i A ijyj 

7=1 J=l 7=1 J=l 

one obtains the following form for the best response condition: 

771 7771 

E E X ilAl,Ji%'l 
71=1 jl = l 



+ 


772 7772 

E E 
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77 2 777 2 


+ 


E E 
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771 7771 

E E x i\ A i^j\Vh 

71=1 jl=l 

As this is just the sum of the best response conditions for the individual games {A X ,B 2 ) and 
{A 2 , B 2 ), the claim follows. □ 



n 2 

Theorem 3. If (x,y) is a Nash equilibrium of (A,B), then (x l ,y l ) given by x\ = ^ x^j and 

'=1 

7772 

yj = E V\j,l\ i s a Nash equilibrium of (A 1 , B 1 ). 
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Proof. Again the proof uses contraposition. Assume w.l.o.g. that x 1 is a better response against 
y 1 than x 1 , that is: 

ni mi ni mi 

i=l p=l i=l j=l 

n 2 mj 

Add E E x 2 A 2 -y 2 on both sides, and apply the reverse of the transformation used in the proof 
»=ij=l 

of Theorem [2j Then one obtains: 

niri2 m\m,2 n\ri2 m\m,2 

5^ XiAijVj > XiAijljj 
i=l j=l i=X j=X 

with x defined via aSfe,^] = x^x^. This contradicts the assumption that x would be a best 
response against y, so (x, y) cannot be a Nash equilibrium. □ 

2.2 Sums 

The sum of games involves another parameter besides the two component games, which just is 
a number exceeding the absolute value of all the payoffs. 

Definition 4. Given an n% x mi bimatrix game (A , Br) and an ri2 x ni2 bimatrix game 
(A 2 ,B 2 ), we define the {n\ + ri2) x (mi + m%) sum game {A l ,B v ) + (A 2 ,B 2 ) via the constant 
A' > m&Xij{\Aij\, Bij\} as (^4, -B) with: 



Ajj if i < nx, j < mi 

4-ni),(j-mi) if«>ni,j>mi 

AT otherwise 

' if i < ni , j < mx 

5 ( 2 i-m),0-mi) iii>nx,j>mx 

—K otherwise 



Lemma 5. Let (x,y) be a Nash equilibrium of (A 1 ^ 1 ) + (A 2 , B 2 ). Then < (E^Li 3 ^) < 1 
andO< (ESi%) <1- 



Proof. Assume (x, y) is a Nash equilibrium. The following circular reasoning demonstrates that 
any of the forbidden cases yields a contradiction. 

1. If = 52?=i x «' tnen also = Y2J=x Vj- 

If y, > for any j < mx, then y' defined via y'j = 0, y' mi+l = y mi +i + Vj and y[ = yi for 
I ^ j, mx + 1 is a better response against x than y: The payoff difference between y' and y 
52i 2 =xi B i2,mi+i K)x ril J r i 2 J , and by choice of A eveiy (-Bj 2)mi _|_i + AT) is positive. 

2. IfO = E J m = 1 i%-,then Er=i^ = l- 

If Xj > for any i > nx, then x' defined via x\ = 0, x'i = xi + x« and x\ = x\ for 
/ 7^ l,i is a better response against y than x: The payoff difference between x' and y is 
i( ^i-nu'2 K)y mi +j2) i an d by choice of A" every ( Af_ ni j 2 + A) is positive. 
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3 - If E"=i x i = 1 i then Epi J/j = !• 

The proof proceeds as in 1. via symmetry. 



4- if£piy; = MhenO = £^i 



X7, . 



The proof proceeds as in 2. via symmetry. 

□ 

Theorem 6. If (x,y) is a Nash equilibrium of (A 1 ^ 1 ) + (A 2 ,B 2 ), then a Nash equilibrium 

(x ,y ) of (^l 1 ,-^? 1 ) can be obtained as xj = — and y] = — . 

* E;=i n J E;=i vi 

Proof. By LemmaJHJ (x 1 ,?/ 1 ) is well-defined, and clearly a strategy profile. W.l.o.g. we assume 
that x 1 is not a best response against y 1 , and derive a contradiction. Let x 1 be a better response 
against y l than x . Define x via x~j = xj (S"=i x f° r * — ni ' an< ^ x * = ^ otherwise. We claim 
that x is a better response against y than x. The payoff difference between x and x for the 
first player can readily be computed to be (XT=i x i) £Ti=i £ji=i ^hji (X\ ~~ x i 1 )Ui2- Up to the 
positive factor [(X^r=i x (Sfc^i^)] > this * s ec L ua l to the payoff difference between x 1 and 
x , hence the former is positive iff the latter is. □ 

Theorem 7. Let (x k , y k ) be a Nash equilibrium of (A k , B k ) resulting in payoffs (P k , Q k ) for both 
k £ {1,2}. Then (x,y) is a Nash equilibrium of (^4 1 ,i? 1 ) + (A 2 ,B 2 ), where 

for i < m, Xi = xf_ m 2K K _q?_ Q 2 for i>nx, yj = y] 2K * P C p2 for j < mi, = y?_ mi 2 k_~ p T_ P 2 
for j > mi. 



Proof. In the given situation, assume (x,y) were not a Nash equilibrium. W.l.o.g., let this be 
due to x not being a best response to y. This is equivalent to the existence of some i with x« > 0, 
but the pure strategy i is not a best response to y. The latter means there is a better response 
k to y, i.e.: 

Case i < ni,k < n\ 



^,1 1 JT-P 2 , ^„ a JT-P 1 

A knVh nff _ pi _ p2 I + I A »J2 OR' _ PI _ P2 



2/V - /" - /'- / I ^ "* 2 2A - /" - /> 
ii=l / V?2=l 



\ii=l / \j'2=i 



Subtracting fc^Li K Vj 2 2 k-p^-p 2 ) 011 both sides, then dividing by 2X ^ p f^ p2 shows that 
A; is a better response against y 1 than i in the game (A l ,B x ). But then xj = follows, 
hence x% = in contradiction to the assumption. 

Case j < m, k > n\ 



( mi 1 K — P 2 \ ( ™ 2 1 2 
I] 2A" - pi - P 2 + £ 

\J1=1 ' / \J2=1 



1)32 2K - P 1 - P 2 



> £ ^iWi! 9y _pl_p2 + £ *v! 



2K-P 1 -P 2 I ' \ ^ " aj2 2K- P 1 - P 2 

Jl=l / V72=l 
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As they are sums over stochastic vectors, we find [Y^,™ 1 Uj-A = ^JL ) = Moreover, 

Xi 7^ implies x| 7^ 0, and this in turn implies [Y^Li ^Ij-JJj^j = P 1 ■ Hence, after 
multiplying both sides by (2K — P 1 — P 2 ) the previous inequality simplifies to: 

K(K-P 2 )+ (jpAl j2 ylj {K-P 1 )>P\K~P 2 )+K{K-P 1 ) 

This in turn can be simplified to [Y^Li A^faVj^) > P 2 , which contradicts the assumption 

P 2 were the optimal payoff achievable by player 1 against y 2 . 
Case i > m, k < n\ Analogous to Case i < n%, k > m 
Case i > m, k > n\ Analogous to Case i < m,k < n\ 

□ 



3 The algorithm 

Our basic algorithm proceeds as follows: To solve a game (A, B) 

1. test whether (^4, B) is the sum of (A 1 , B 1 ) and (A 2 , B 2 ) via some constant K. If yes, solve 
(A \B ) and (A 2 ,B 2 ) and combine their Nash equilibria to an equilibrium of (A,B) via 
Theorem [7j If no, 

2. test whether (A, B) is the product of (A l ,B l ) and (A 2 ,B 2 ). If yes, solve (A 1 ^ 1 ) and 
(A 2 , B 2 ) and combine their Nash equilibria to an equilibrium of (A, B) via Theorem [2j If 
no, 

3. find a Nash equilibrium of (A, B) by some other means. 

For some n x m game (A, B) let let S(A, B) denote its size, i.e. S(A, B) = nm, and let 
X(A, B) be the size of the largest game for which 3. in our algorithm is called. Let f(k) be the 
time needed for the external algorithm called in 3. on a game of size k. Then the runtime of our 
algorithm is bounded by 0(5 3 /(A)), in particular, it is an /pt-algorithm: 

Testing whether a game is a sum, and computing the components, if applicable, can be done 
in linear time. The sum of the sizes of the components is less than the size of the original game. 
Finally, combining Nash equilibria can be done in linear time, too. 

Whether a game is a product of factors of a fixed size can also be tested in linear time. 
Testing the different possible factors yields quadratic time for this part. This already includes 
computing the components, and the product of the sizes of the factors is equal to size of the 
original game. Again, combining the Nash equilibria takes linear time. 

As a slight modification of our algorithm, one can eliminate (iteratively) strictly dominated 
strategies at each stage of the algorithm. We recall that a strategy i of some player is called 
strictly dominated by some other strategy j, if against any strategy chosen by the opponent, 
i provides its player with a strictly better payoff than j. A strictly dominated strategy can 
never be used in a Nash equilibrium. It is easy to verify that a game decomposable as a sum 
never has any strictly dominated strategies, but may occur as the result of the elimination of 
such strategies. Hence, including an elimination step for each stage increasing the potential for 
decomposability. Elimination of strictly dominated strategies commutes with decomposition of 
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products, i.e. the reduced from of the product is the product of the reduced forms of the factors. 
The algorithm remains fpt if such a step is included. A detailed investigation of complexity 
issues regarding removal of dominated strategies can be found in [16] . 

4 Empirical evaluation 

Only a small fraction of the bimatrix games of a given size and bounded integer payoffs will be 
decomposable by our techniques, this limiting the applicability of the algorithm in Section [3} 
However, to some extent we can expect patterns in the definitions of real-world game situation 
to increase the decomposability of the derived bimatrix games. For example, the structure of 
Poker-style games implies decomposability, as can be concluded from the considerations in |10j . 

To obtain a first impression whether using the decomposition algorithm is indeed beneficial, 
a collection of 100 random decomposable games was created. Each game has 95-105 strategies 
per player, and payoff values range from to 50. The decomposability was ensure by creating 
a random tree representing the relevant decomposition structure first, using probabilities of 0.4 
each for sum and product decomposition, and of 0.2 for an elimination of strictly dominated 
strategies step. The height of the trees was limited to 80, additionally vertices corresponding to 
games of size up to 6 were turned into leaves. At the leaves, the payoffs were chosen uniformly 
subject to the constraints derived from the structure and the overall constraint of payoff values 
being between and 50. Finally, the corresponding bimatrix games were computed. 

Both as a benchmark, and in order to compute Nash equilibria of the irreducible component 
games, the tool Gambit [T4] was used. Gambit offers a variety of algorithm for computing 
Nash equilibria of bimatrix games, we used: 

1. gambit-enummixed: using extreme point enumeration 

2. gambit-gnm: using a global Newton method approach 

3. gambit-lcp: using linear complementarity 

4. gambit-simpdiv: using simplicial subdivision 

Figures 1.-4. show for each of the Gambit algorithms how many of our decomposable 
example games could be solved in some given time bound (per game, not total) using only the 
Gambit algorithm directly, or exploiting decomposition implemented in C+- 1- first. Despite the 
fact that our decomposition algorithm was not optimized, it turned out that using decomposition 
almost all games could be solved in under 3 seconds, whereas even gambit-gnm as the fastest 
Gambit algorithm on the sample took 30 seconds for a similar feat. Thus, there is clear 
indication that on suitable data, exploiting the algebraic structure underlying the decomposition 
algorithm yields a significant increase in performance. 
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Figure 1: gambit-enummixed 
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